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ABSTRACT 


Previous molecular phylogenetic studies have recovered conflicting hypotheses of relationship 
among Juniperus (J), Cupressus (C), and Hesperocyparis-Callitropsis-Xanthocyparis (HCX). Conflict 
between nuclear genes, chloroplast genes, and nuclear and chloroplast data have all been realized in 
recovering all possible topologies among the three clades. In this study, we use 2.2 kb of aligned 
sequence from two nuclear loci, and 11.4 kb of sequence from 11 chloroplast regions, in re-examining 
relationships among J-C-HCX. Unlike previous studies, we find unambiguous support for relationships 
in the nuclear data, whether the genes are analyzed individually or in combination. In contrast, character 
conflict between different chloroplast partitions, or even between characters from a single region, results 
in nearly equally well-supported but conflicting hypotheses of relationship. Statistical tests of likelihood 
values indicate the chloroplast data always fails to distinguish between two of three competing sister 
group relationships, and in one instance cannot differentiate between any of the three possible J-C-HCX 
topologies. Results presented here suggest a complex evolutionary history in which molecular processes 
in addition to possible ancient hybridization have obscured J-C-HCX relationships. Published on-line 
www.phytologia.org Phytologia 97(1): 67-75 (Jan 2, 2015). ISSN 030319430. 
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Cupressaceae is the third largest gymnosperm family with over 130 species in about 33 genera 
(Farjon, 2005; Farjon et al., 2002; Adams et al., 2009). The family is well represented in both the 
northern and southern hemispheres, with members occupying all habitable continents and occurring in a 
variety of habitats (Farjon, 2005; Adams, 2014.) Rarity and high degrees of endemism are 
disproportionately represented in the family, with 18 genera being monotypic and 27 having five or fewer 
species (Farjon, 2005). Among the more diverse genera in the family are Juniperus (67 species, 34 
varieties), many species of which are adapted to semi-arid habitats in the northern Hemisphere, 
Cupressus, a genus of 12 species (Little, 2005) geographically centered in Asia (Mao et al., 2010) and 
generally known as the “Old World cypresses” (OWC), and Hesperocyparis, a recently recognized genus 
of 17 species (Adams et al., 2009; Adams et al., 2014; Wolf, 1948) from the western United States, 
Mexico, and central America (i. e., the New World cypresses or NWC). 


A spate of phylogenetic studies published over the last decade have resulted in new perspectives 
on the phylogeny of Juniperus, Cupressus, Hesperocyparis and related taxa (Little et al., 2004; Little, 
2005; Little, 2006; Adams et al., 2009; Yang et al 2012; Terry et al., 2012). The recovery and 
taxonomic recognition of Hesperocyparis as distinct from Cupressus (Adams et al., 2009), strong support 
for inclusion of Callitropsis and Xanthocyparis in a lineage with Hesperocyparis (1. e., the HCX lineage 
of Terry et al., 2012; Little et al., 2004; Little, 2006; Adams et al., 2009), and studies elucidating species 
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relationships (Terry et al., 2012) and the recognition of new species (Adams et al., 2014) within 
Hesperocyparis and Juniperus (Mao et al., 2010), collectively represent our improved understanding of 
evolutionary and taxonomic relationships in the group. 


Despite these advances, a number of outstanding questions remain. Among these are 
relationships among certain genera of Cupressaceae (Gadek et al., 2000; Yang et al., 2012), including 
those among Juniperus ( J), Cupressus (C), and HCX. One of the first studies to address relationships 
among Juniperus and Old and New world representatives of Cupressus was that of Gadek et al. (2000), 
which used molecular and morphological data in addressing relationships among the major lineages of 
Cupressaceae. Parsimony analysis of cpDNA sequences recovered a clade containing distinct Old and 
New World Cupressus as sister to Juniperus, 1. e., J (OWC, NWC) (Gadek et al., 2000). Two subsequent 
studies used cpDNA sequences to corroborate the findings of Gadek et al. (2000) in recovering a 
J(C,HCX) topology (Adams et al., 2009; Yang et al., 2012), while relationships among the three lineages 
were unresolved in a third study that used cpDNA (see Little, 2006). Three studies have used DNA 
sequences from a total of five nuclear loci in addressing relationships among J, C, and HCX. Two 
general patterns emerge from these studies: nrITS sequences always yield a C(J,HCX) topology (Adams 
et al., 2009; Little, 2006), and the other data sets, either alone or in various combinations, yield a 
HCX(J,C) topology (Little, 2006; Adams et al., 2009; Yang et al., 2012). Collectively, these findings 
indicate conflict between nrITS and other nuclear data sets (ABI3, 4CL, Needly, and Leafy) in resolving 
relationships among J-C-HCX, but in no instance are phylogenies derived from nuclear data congruent 
with those based on cpDNA, a finding some authors attribute to ancient hybridization (Yang et al., 2012). 


In this study, we re-examine relationships among J-C-HCX using nearly 13.7 kb of aligned DNA 
sequence from both the chloroplast and nuclear genomes. Results from separate analyses of the 
cytoplasmic and nuclear data as well as combined analyses are used to re-assess relationships among J-C- 
HOS 


MATERIALS AND METHODS 


Specimens used in this study with voucher information and GenBank accession numbers are 
provided in Table 1. For all specimens, one gram (fresh weight) of foliage was placed in 20g of activated 
silica in the field, and subsequently stored at -20°C in the lab. 


DNA extraction, PCR amplifications, and preparation of sequencing templates are according to 
Terry et al. (2012). Briefly, total genomic DNA was extracted from 0.020 g of silica dried leaf tissue 
using a DNeasy Plant Mini Kit according to the manufacturer’s instructions (Qiagen, Valencia, CA, 
USA). The psbD-trnT intergenic spacer and the trnC-trnD intergenic region containing spacer sequence 
and a portion of the psbM coding region were amplified and sequenced for two species of Calocedrus, 
four species of Juniperus, and three species of Cupressus (Table 1). All other sequences were previously 
published (Gadek et al., 2000; Little et al., 2004; Little, 2006; Terry et al., 2012) and are available in 
GenBank. Thermal cycling protocols for all amplifications were as follows: 94°C for 5 min, followed by 
30 cycles of 94°C for 1 min, 2 min at the optimized annealing temperature, and 72°C for 2 min, followed 
by 72°C for 7 min. Annealing temperatures were 47.5°C for psbD-trnT and 50°C for trnC-trnD. Primer 
sequences and other amplification details are given in Terry et al. (2012). PCR products were purified by 
agarose gel electrophoresis according to Terry et al. (2012) and sequenced at McLab Inc. (San Francisco, 
CA). 


Combining data from this study with chloroplast and nuclear sequences from GenBank produced 
13.7 kbp of aligned sequence from 9 noncoding chloroplast regions (8 intergenic spacers and one intron), 
2 chloroplast genes (rbcL and psbB), and 2 nuclear genes (nrITS and NEEDLY intron 2). For sequences 
published here, raw sequence from forward and reverse strands was assembled and aligned using Clustal 
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Omega (http://www.ebi.ac.uk/Tools/msa/clustalo/) or MAFFT. Computer generated alignments were 
manually refined using Seq-Al v.2.0a9 (Rambaut 2002). Both parsimony and Bayesian analyses were 
performed on each of three data sets: chloroplast data only, nuclear data only, and combined chloroplast 
and nuclear data. Parsimony analyses were conducted using PAUP*v.4.0b10 (Swofford 2002), with the 
heuristic search option in effect, simple stepwise addition of taxa, and TBR branch swapping, saving 
multiple trees. Branch support was assessed by conducting 1000 replicates of bootstrapping with the 
settings described above. Bayesian analyses were conducted using MrBayes 3.2.1 (Ronquist and 
Huelsenbeck, 2003) according to Terry et al. (2012). Best-fit evolutionary models were estimated for 
individual gene regions using the Akaike information criterion (AIC) implemented in (Model Test v.0.0.1 
(Posada 2008; Guindon and Gascuel 2003) using the default settings for likelihood calculations and the 
uncorrected AIC. Bayesian analyses were fully partitioned by gene region, with two independent runs of 
four Metropolis coupled chains each. Chains were generated from different random trees and run for 1 
million generations, sampling every 1,000th generation. In each run, three chains were heated using a 
temperature of 0.2 with one swap between chains every generation. The burnin fraction was enforced to 
0.2 using the ‘‘relburnin’’ command, resulting in the first 200 of 1,000 trees being discarded, and the 
remaining trees pooled to construct the posterior distribution of the phylogeny. A 50 % majority-rule 
consensus tree was produced using the ‘‘contype = halfcompat’’ command. Convergence and mixing 
were assessed by examining plots of likelihood against chain generation over the course of the run and by 
monitoring the standard deviation of split frequencies among runs in MrBayes. 


We statistically compared log likelihood values in assessing the relative support of the nuclear 
and chloroplast data for each the three possible J-C-HCX topologies. Three tests were performed; a 1- 
sided Kishino-Hasegawa (KH; Goldman et al., 2000), the Shimodaira-Hasegawa (SH; Shimodaira and 
Hasegawa, 1999), and the expected likelihood weights (ELW; Strimmer and Rambaut 2002). Each test 
was performed on each of three user defined trees, the two trees from parsimony analysis of the cpDNA 
only [(HCX(J,C) and J(C,HCX)], and the single tree from the combined parsimony analysis (C(J,HCX). 
Maximum likelihood analyses and statistical tests of fit were performed in Tree Puzzle 5.2 (Schmidt et 
al., 2002). Default settings were used in all tests except a gamma distribution with four rate categories 
was used in estimating rate heterogeneity. 


RESULTS AND DISCUSSION 


Parsimony analysis of the chloroplast data only produced two shortest-length tree of 759 steps 
(CI=0.90, RI=0.91; Fig. 1). Bootstrapping of these data produced strong support for most branches, but 
relationships among J, C, and HCX were unresolved in the 50% majority rule tree (Fig. 1). Of the two 
most parsimonious trees recovered, one was consistent with previous reports (Gadek et al., 2000; Adams 
et al., 2009; Yang et al., 2012) in recovering J(C,HCX), while the other recovered HCX(J,C). Bayesian 
analysis of the chloroplast data also recovered a HCX(J,C) topology, but while nearly all branches had 
posterior probabilities (pp) of 1.0, the J-C clade was weakly supported (pp= 0.62; Fig. 2) . 


In contrast to the chloroplast data, analyses of the nuclear data alone, or of combined nuclear and 
chloroplast data, consistently produced strong support for a clade containing J and HCX (Figs. 3-5). In 
addition, parsimony analysis of the nuclear data alone recovered four shortest length trees, all of which 
contained C(J,HCX), and strong support for a J-HCX sister group relationship (Fig 3). Similarly, 
parsimony analysis of the combined data produced a single tree of 1522 steps (CI=0.87, RI=0.90) in 
which a well-supported J-HCX clade was recovered (Fig. 5). Bayesian analysis of nuclear data alone or 
of combined nuclear and chloroplast data always recovered a C(J,HCX) topology with strong support 
(pp=1.0) for the J-HCX clade (Figs. 4 and 5). 


Maximum likelihood analysis using Tree Puzzle found the HCX(J,C) and J(C,HCX) topologies 
explained the cpDNA nearly equally well, while the C(J,HCX) topology produced the least likely 
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explanation of the data. All tests found no 
significant difference between the HCX(J,C) and 
J(C,HCX) topologies, and one of three tests (SH) 
found no difference among the three possible J-C- 
HCX alternatives (Table 2). 


Here, we re-examine relationships between 
Juniperus, Cupressus, and HCX with 13.7 kb of 
aligned DNA sequence. Our data include 2329 bp of 
aligned sequence from two nuclear genes (nrITS and 
Needly), and 11402 bp of sequence from 11 
chloroplast regions (Table 1). We consistently 
recover a C(J,HCX) topology from the nrITS and 
Needly data sets, analyzed either alone or in 
combination. This finding is supported by results 
from only one previous study (i.e. Adams et al., 
2009, which used combined nuclear and chloroplast 
data), and is in conflict with the HCX(J,C) topology 
recovered from analyses of several other nuclear loci 


Figure 1. 50% majority-rule consensus of two most parsimonious trees generated from analysis of 
chloroplast data only. Length = 759 steps, CI = 0.90, RI = 0.91. Numbers below branches are bootstrap 
values, and are not given for values less than 50%. 
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Figure 2. 50% majority-rule consensus tree generated from Bayesian analysis of chloroplast data only. 
Numbers below branches are posterior probabilities. 
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(Little, 2006; Adams et al., 2009; Yang et al., 
2012). Nevertheless, we find strong support for a 
J-HCX sister group relationship (bootstrap=100, 
pp=1.0) in all analyses including nuclear data 
(Figs. 3-5), and note that is no instance is nuclear 
data unable to statistically distinguish C(J,HCX) 
from either of the other two J-C-HCX 
alternatives (data not shown). Moreover, 
character analysis identified 42 synapomorphies 
for the J-HCX clade in the combined analysis, 35 
from the nuclear genes and nearly equally 
divided between nrITS and Needly (combined CI 
of 0.88), and 7 from the chloroplast data 
(CI=0.79). 


In contrast to the nuclear data, 
chloroplast sequences do not provide strong 
support for any particular hypothesis of J-C-HCX 
relationship. Perhaps this is best exemplified by 
results in which the chloroplast data never 
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Figure 4. 50% majority-rule consensus tree generated from Bayesian analysis of nuclear data only. 
Numbers below branches are posterior probabilities. 
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Figure 5. 50% majority-rule consensus tree generated from Bayesian analysis of all data. Numbers 
below branches are posterior probabilities. Parstmony analysis of all data produced one tree (1522 steps, 
CI=0.87, RI=0.90) having the same topology with respect to J-C-HCX relationships as the Bayesian 
consensus tree. The bootstrap value from parsimony analyses for the J-HCX clade is given above the 
branch. 


distinguish between J(C,HCX) and HCX(J,C), and for one test, cannot distinguish between any of the 
three possible J-C-HCX alternatives (Table 2). Neither bootstrap values (data not shown) nor posterior 
probabilities (Fig. 2) provide strong support for sister group relationships, and in the case of parsimony 
analyses, the number of synapomorphies and their consistency is nearly identical for conflicting sister 
group hypotheses (data not shown). Previous studies based on cpDNA were either unresolved with 
respect to J-C-HCX relationship (Little, 2006) or have found J(C,HCX) (Adams et al., 2009; Yang et al., 
2012), but in no instance has a well-supported C-HCX sister group relationship been recovered. 


Some authors have suggested Cupressus originated through hybridization between Juniperus and 
the common ancestor of HCX, an assertion based on conflict between different nuclear loci (Needly and 
Leafy vs. nrITS), similarity in Cupressus Needly and Leafy sequences to those of both Juniperus and 
HCX, and conflict between topologies derived from cpDNA (matK) and nuclear sequences (Yang et al., 
2012). Results present here are different from those of previous studies in that we find little or no conflict 
among different nuclear partitions in recovering a well-supported C(J,HCX). In addition, we find little 
support for J-C-HCX relationships in the cpDNA data, although the cpDNA data never supports a J-HCX 
clade, and C(J,HCX) is excluded from the other two alternatives in two of three statistical comparisons of 
topology (Table 2). Collectively, these findings suggest that if conflict in topologies supported by nuclear 
and cpDNA data is attributable to ancient hybridization (Yang et al., 2012), then other processes 
producing ambiguity in the chloroplast data, or conflict between different nuclear genes (Yang et al., 
2012), have also been important in the evolutionary history of the group. 
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TSSEPTOINH] og Idi, PLSLLTdMx|T L8SEZ0INH|T ST9PZOIWH] TTL T8864 V1 6PPPZOINH]T SZ TPZOINH|T'SZ0r7OINE | 786EZOINH|TOLTPCONNH]T 697886AV] TSS808EAV] ` WeUDTA ETEL UM) “ZS9OT sugpvlscdaoumem "7 
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Table 2. Results from maximum likelihood analysis testing the fit of the three possible J-C-HCX 
topologies to the chloroplast data. p-values are given under the test name and + indicates inclusion in the 
confidence set. 


